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Abstract. We establish central and local limit theorems for the number of vertices in the largest com- 
ponent of a random d-uniform hypergraph Hd(n,p) with edge probability p = c/y^A, where 
(d — l)" 1 + £ < c < oo. The proof relies on a new, purely probabilistic approach, and is based 
on Stein's method as well as exposing the edges of Hd(n,p) in several rounds. 
Keywords: random graphs and hypergraphs, limit theorems, giant component, Stein's method. 

1 Introduction and Results 

A d-uniform hypergraph H = (V, E) consists of a set V of vertices and a set E of edges, which are subsets 
of V of cardinality d. Moreover, a vertex w is reachable in H from a vertex v if either v = w or there is a 
sequence ei, . . . , efe of edges such that v £ e\, w € e^, and e; n e,_|_i ^ for i = 1, . . . , k — 1. Of course, 
reachability in H is an equivalence relation. The equivalence classes are the components of H, and H is 
connected if there is only one component. 

Throughout the paper, we let V — { 1 , . . . , n} be a set of n vertices. Moreover, if 2 < d is a fixed integer 
and < p = p(n) < 1 is sequence, then we let Hd(n,p) signify a random d-uniform hypergraph with 
vertex set V in which each of the ( *J) possible edges is present with probability p independently. We say 
that Hd(n,p) enjoys some property V with high probability (w.h.p.) if the probability that Hd(n,p) has V 
tends to 1 as n — > oo. If d — 2, then the Hd(n, p) model is identical with the well-known G(n, p) model of 
random graphs. In order to state some related results we will also need a different model Hd(n, m) of ran- 
dom hypergraphs, where the hypergraph is chosen uniformly at random among all <i-uniform hypergraphs 
with n vertices and m edges. 

Since the pioneering work of Erdos and Renyi [8], the component structure of random discrete struc- 
tures has been a central theme in probabilistic combinatorics. In the present paper, we contribute to this 
theme by analyzing the maximum order JV(Hd{n,p)) of a component of Hd(n,p) in greater detail. More 
precisely, establishing central and local limit theorems for JV(Hd(n,p)), we determine the asymptotic 
distribution of Af(Hd(n,p)) precisely. Though such limit theorems are known in the case of graphs (i.e, 
d = 2), they are new in the case of d-uniform hypergraphs for d > 2. Indeed, to the best of our knowledge 
none of the arguments known for the graph case extends directly to the case of hypergraphs (d > 2). There- 
fore, we present a new, purely probabilistic proof of the central and local limit theorems, which, in contrast 
to most prior work, does not rely on involved enumerative techniques. We believe that this new technique 
is interesting in its own right and may have further applications. 



The giant component. In their seminal paper [8], Erdos and Renyi proved that the number of vertices in 
the largest component of G(n, p) undergoes a phase transition as np ~ 1. They showed that if np < 1 — e 
for an arbitrarily small e > that remains fixed as n — > oo, then all components of G(n,p) consist of 
O(lnn) vertices. By contrast, if np > 1 + e, then G(n,p) has one giant component on a linear number 
i?(n) of vertices, while all other components contain only C(ln n) vertices. In fact, in the case 1 + e < c = 
(n — l)p = 0(1) Erdos and Renyi estimated the order (i.e., the number of vertices) of the giant component: 
let Af(G(n, p)) signify the maximum order of a component of G(n, p). Then 

n~ l N(G(n,p)) converges in distribution to the constant 1 — p, (1) 
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where < p < 1 is the unique solution to the transcendental equation p = exp(c(p — 1)). 

A corresponding result was established by Schmidt-Pruzan and Shamir [17] for random hypergraphs 
Hd(n,p). They showed that a random hypergraph Hd(n,p) consists of components of order O(lnn) if 
(d — 1)(jZi)? 5 < 1 — £, whereas Hd(n,p) has a unique large (the giant) component on Qin) vertices 
w.h.p. if (d — 1)(jZi)p > 1 + e. Furthermore, Coja-Oghlan, Moore, and Sanwalani [7] established a 
result similar to (1), showing that in the case (d — 1)(jZi)p > 1 + £ the order of the giant component is 
(1 — p)n + o(n) w.h.p., where < p < 1 is the unique solution to the transcendental equation 

p = cxp(c(/- 1 - 1)). (2) 



Central and local limit theorems. In terms of limit theorems, (1) provides a strong law of large numbers 
for Af(G(n,p)), i.e., it yields the probable value of Af(G(n,p)) up to fluctuations of order o(n). Thus, 
a natural question is whether we can characterize the distribution of J\f(G(n,p)) (or J\f(Hd(n,p))) more 
precisely; for instance, is it true that JV(G(n,p)) "converges to the normal distribution" in some sense? 
Our first result, which we will prove in Section 5, shows that this is indeed the case. 



Theorem 1. Let J c ((d — 1)~ , oo) be a compact interval, and let < p 
such that c = c(n) = 1 ' 
to (2), and set 



p(n) < 1 be a sequence 



t-1 )p S J for all n. Furthermore, let < p = p(n) < 1 be the unique solution 
p[l-p + c(d-l)(p 



(1 -c(d- l)/- 1 ) 2 

Then o-~ 1 (J\f(Hd(n,p)) — (1 — p)n) converges in distribution to the standard normal distribution. 
Theorem 1 provides a central limit theorem for Af(Hd(n,p)); it shows that for any fixed numbers a < b 



(J) 



lim P 



a < Af{H d {n,p))-{\- p)n < fe 



(27T)" 



exp(-£ 2 /2)dt 



(4) 



(provided that the sequence p = p(n) satisfies the above assumptions). 

Though Theorem 1 provides quite useful information about the distribution of Af(Hd{n,p)), the main 
result of this paper is actually a local limit theorem for N(Hd (n,p)), which characterizes the distribution of 
Af(Hd(n, p)) even more precisely. To motivate the local limit theorem, we emphasize that Theorem 1 only 
estimates Af(G(n,p)) up to an error of o(er) = o(^fn). That is, we do obtain from (4) that for arbitrarily 
small but fixed 7 > 



P [\M(H d {n,p)) - u\ <ja] 



1 



2tt(7 



1 a 



cxp 



J<7 



(v-(l-p)n-tf 



2cr 2 



dt. 



(5) 



i.e., we can estimate the probability that Af(Hd{n,p)) deviates from some value v by at most ja. However, 
it is impossible to derive from (4) or (5) the asymptotic probability that Af(Hd{n, p)) hits v exactly. 
By contrast, our next theorem shows that for any integer v such that \v — (1 — p)n\ < 0(a) we have 



P[M(H d (n,p)) 



27TCT 



exp 



(u-(l- p)n) 



2 1 



2a 2 



(6) 



provided that (d — 1)~ 



e < 



O(l). Note that (6) is exactly what we would obtain from (5) if 



we were allowed to set <5 = ^cr(n, p) 1 in that equation. Stated rigorously, the local limit theorem reads as 
follows. 

Theorem 2. Let d > 2 be a fixed integer. For any two compact intervals X C R, J C ({d — 1) , 00), 
and for any S > there exists tiq > such that the following holds. Let p = p(n) be a sequence such that 
c = c(n) = (™T-j)p ^ S for all n, let < p = p(n) < 1 be the unique solution to (2), and let a be as 
in (3). If n > no and if v is an integer such that o~ x (y — (1 — p)n) G T, then 



/2na 



exp 



{v-{l-p)nf 
2a 2 



<P[Af(H d {n,p))=v] < 



2ltO 



cxp 



(v-(\-p)nf 
2a 2 
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Related work. Since the work of Erdos andRenyi [8], the component structure of G(n,p) = H2{n,p) has 
received considerable attention. Stepanov [19] provided central and local limit theorems for Af(G(n,p)), 
thereby proving the d = 2 case of Theorems 1 and 2. In order to establish these limit theorems, he estimates 
the probability that a random graph G(n,p) is connected up to a factor 1 + o(l) using recurrence formulas 
for the number of connected graphs. Furthermore, Barraez, Boucheron, and Fernandez de la Vega [2] 
reproved the central limit theorem for JV(G(n,p)) via the analogy of breadth first search on a random 
graph and a Galton-Watson branching process. In addition, a local limit theorem for Af(G(n,p)) can also 
be derived using the techniques of van der Hofstad and Spencer [9], or the enumerative results of either 
Bender, Canfield, and McKay [5] or Pittel and Wormald [15]. 

Moreover, Pittel [ 14] proved a central limit theorem for the largest component in the G(n,m) model of 
random graphs; G(n, m) is just a uniformly distributed graph with exactly n vertices and m edges. Indeed, 
Pittel actually obtained his central limit theorem via a limit theorem for the joint distribution of the number 
of isolated trees of a given order, cf. also Janson [10]. A comprehensive treatment of further results on the 
components of G(n,p) can be found in [11]. 

In contrast to the case of graphs, only little is known for d-uniform hypergraphs with d > 2; for the 
methods used for graphs do not extend to hypergraphs directly. Using the result [12] on the number of 
sparsely connected hypergraphs, Karonski and Luczak [13] investigated the phase transition of Hd(n,p). 
They established (among other things) a local limit theorem for Af(Hd(n, m)) for m = n/d(d— l) + l and 
1 <C ^ < lt ["" rt which is similar to H d (n,p) at the regime (^Z])p = (d — 1) _1 + w, where n^ 1 / 3 <c 
w = uj(n) <C n -1 / 3 Inn/ In Inn. These results were extended by Andriamampianina, Ravelomanana and 
Rijamamy [1, 16] to the regime I = o(n 1 / 3 ) (lu = o(n~ 2 / 3 ) respectively). 

By comparison, Theorems 1 and 2 deal with edge probabilities p such that (jZi)f> = (d— 1) _1 + ^(1), 
i.e., (jZi)p i s bounded away from the critical point (d — 1) _1 . Thus, Theorems 1 and 2 complement [1, 
13, 16]. The only prior paper dealing with ( jZi)p = (d — l)^ 1 + S7(l) is that of Coja-Oghlan, Moore, and 
Sanwalani [7], where the authors computed the expectation and the variance of J\f(H<i(n,p)) and obtained 
qualitative results on the component structure of Hd{n,p). In addition, in [7] the authors estimated the 
probability that Hd(n,p) or a uniformly distributed d-uniform hypergraph Hd(n, m) with n vertices and 
m edges is connected up to a constant factor. While in the present work we build upon the results on the 
component structure of Hd(n,p) from [7], the results and techniques of [7] by themselves are not strong 
enough to obtain a central or even a local limit theorem for Af{Hd(n, p)). 

Techniques and outline. The aforementioned prior work [1, 12, 13] on the giant component for random 
hypergraphs relies on enumerative techniques to a significant extent; for the basis [1, 12, 13] are results 
on the asymptotic number of connected hypergraphs with a given number of vertices and edges. By con- 
trast, in the present work we employ neither enumerative techniques nor results, but rely solely on proba- 
bilistic methods. Our proof methods are also quite different from Stepanov's [19], who first estimates the 
asymptotic probability that a random graph G(n,p) is connected in order to determine the distribution of 
J\f(Hd(n, p)). By contrast, in the present work we prove the local limit theorem for Af(Hd(n, p)) directly, 
thereby obtaining "en passant" a new proof for the local limit theorem for random graphs G(n,p), which 
may be of independent interest. Besides, the local limit theorem can be used to compute the asymptotic 
probability that G(n,p) or, more generally, Hd(n,p) is connected, or to compute the asymptotic number 
of connected hypergraphs with a given number of vertices and edges (cf. Section 6). Hence, the general 
approach taken in the present work is actually converse to the prior ones [1, 12, 13, 19]. 

The proof of Theorem 1 makes use of Stein 's method, which is a general technique for proving central 
limit theorems [18]. Roughly speaking, Stein's result implies that a sum of a family of dependent random 
variables converges to the normal distribution if one can bound the correlations within any constant-sized 
subfamily sufficiently well. The method was used by Barbour, Karonski, and Rucihski [3] in order to 
prove that in a random graph G(n,p), e.g., the number of tree components of a given (bounded) size is 
asymptotically normal. To establish Theorem 1, we extend their techniques in two ways. 

- Instead of dealing with the number of vertices in trees of a given size, we apply Stein's method to the 
total number n — J\f(Hd(n,p)) of vertices outside of the giant component; this essentially means that 
we need to sum over all possible (hyper)tree sizes up to about In n. 
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- Since we are dealing with hypergraphs rather than graphs, we are facing a somewhat more complex 
situation than [3], because the fact that an edge may involve an arbitrary number d of vertices yields 
additional dependencies. 

The main contribution of this paper is the proof of Theorem 2. To establish this result, we think of the 
edges of Hd(n,p) as being added in two "portions". More precisely, we first include each possible edge 
with probability pi = (1 — e)p independently, where e > is small but independent of n (and denote the 
resulting random hypergraph by Hi); by Theorem 1, the order JV(Hi) of the largest component of Hi is 
asymptotically normal. Then, we add each possible edge that is not present in Hi with a small probability 
P2 ~ sp and investigate closely how these additional random edges attach further vertices to the largest 
component of Hi . Denoting the number of these "attached" vertices by S, we will show that the conditional 
distribution of S given the value of AT (Hi) satisfies a local limit theorem. Since pi andj»2 are chosen such 
that each edge is present with probability p after the second portion of edges has been added, this yields 
the desired result on Af(Hd(n,p)). 

The analysis of the conditional distribution of S involves proving that S is asymptotically normal. To 
show this, we employ Stein's method once more. In addition, in order to show that S satisfies a local limit 
theorem, we prove that the number of isolated vertices of Hi that get attached to the largest component 
of Hi by the second portion of random edges is binomially distributed. Since the binomial distribution 
satisfies a local limit theorem, we thus obtain a local limit theorem for S. 

Our proof of Theorem 2 makes use of some results on the component structure of Hd(n,p) derived 
in [7]. For instance, we employ the results on the expectation and the variance of JV(Hd(n,p)) from that 
paper. Furthermore, the analysis of S given in the present work is a considerable extension of the argument 
used in [7], which by itself would just yield the probability that S attains a specific value s up to a constant 
factor. 

The main part of the paper is organized as follows. After making some preliminaries in Section 2, we 
outline the proof of Theorem 2 in Section 3. In that section we explain in detail how Hd(n,p) is generated in 
two "portions". Then, in Section 4 we analyze the random variable S, assuming the central limit theorem 
for S. Further, Section 5 deals with the proof of Theorem 1 and the proof of the central limit theorem 
for S via Stein's method; the reason why we defer the proof of Theorem 1 to Section 5 is that we can 
use basically the same argument to prove the asymptotic normality of both JV(Hd(n,p)) and S. Finally, 
Section 6 contains some concluding remarks, e.g., on the use of the present results to derive further limit 
theorems and to solve enumerative problems. 

2 Preliminaries 

Throughout the paper, we let V = {1, . . . , n}. If d > 2 is an integer and Vi, . . . ,Vk C V, then we let 
£d(Vi, . . . , Vk) signify the set of all subsets e C V of cardinality d such that e fl Vi ^ for all i. We omit 
the subscript d if it is clear from the context. 

If H is a hypergraph, then we let V(H) denote its vertex set and E(H) its edge set. We say that a set 
S C V(H) is reachable from T C V(H) if each vertex s G S is reachable from some vertex t G T. 
Further, if V(H) C V — {1, . . . , n}, then the subsets of V can be ordered lexicographically; hence, we 
can define the largest component of H to be the lexicographically first component of order Af(H). 

We use the O-notation to express asymptotic estimates as n — > oo. Furthermore, if f(xi, . . . , Xk, n) 
is a function that depends not only on n but also on some further parameters Xj from domains Di C R 
(1 < i < k), and if g(n) > is another function, then we say that the estimate f(xi, . . . , Xk, n) = 0(g(n)) 
holds uniformly in xi, . . . ,Xk if the following is true: if X, and Dj, X, C Dj, are compact sets, then there 
exist numbers C = C(li, . . . ,1k) and no = no(Zi, ■ ■ ■ ,1k) such that \f(xi, . . . ,Xk,n)\ < Cg(n) for all 
n > no and (xi, . . . , Xk) G IIj=i -^j- ^ e define uniformity analogously for the other Landau symbols fl, 



We shall make repeated use of the following Chernoff bound on the tails of a binomially distributed 
variable X = B'm(i>, q) (cf. [1 1, p. 26] for a proof): for any t > we have 



0, etc. 




(7) 
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Moreover, we employ the following local limit theorem for the binomial distribution (cf. [6, Chapter 1]). 

Proposition 3. Suppose that < p — p(n) < 1 is a sequence such that np{l — p) — > oo as n — > oo. Let 
X = Bin(n,j>). Then for any sequence x = xin) of integers such that \x — np\ = o(np(l — p)) 2 / 3 , 

i / (x — np) 2 \ 
P [X = x\ ~ (27rnp(l — p)) 2 cxp — — 1 as n ^ oo. 



2p(l — p)n 

Furthermore, we make use of the following theorem, which summarizes results from [7, Section 6] on 
the component structure of Hd{n,p). 

Theorem 4. Let p = c(^)" 1 . 

1. If there is a fixed cq < (d — such that c = c(n) < cq, then 

P [Af(H d (n,p)) < 3(d - 1) 2 (1 - (d - l)c )" 2 Inn] > 1 - n~ 100 . 

2. Suppose that cq > (d — l)" 1 is a constant, and that cq < c = c(n) = o(lnn) fli n -> oo. 77ie« f/te 
transcendental equation (2) has a unique solution < p = p(c) < 1, which satisfies 



pn 
d-1 



p<c' < (d-1)" 1 . (8) 



/or some number Cq > f/iaf depends only on cq. Moreover, 

\&[N{H d {n,p))]-{l-p)n\<n ^, 

p[l-p + c{d-l){p-p d - 1 )\ 



V&Y(Af(H d (n,p))) 



(l-c(d- l)/- 1 ) 2 

Furthermore, with probability > 1— n f/iere is precisely one component of order (l+o(l) ) (1 — p)n 
in Hd(n, p), while all other components have order < In 2 n. /« addition, 

P - E(AA(ff d (n,p)))| > n a51 ] < ti" 100 . 

Finally, the following result on the component structure of Hd(n,p) with average degree (jZi)p < 
(d — l) -1 below the threshold has been derived in [7, Section 6] via the theory of branching processes. 

Propositions. There exists a function q : (0, (d— l) _1 )x[0, 1] — ► R>o, ((,£) >-> <?(0£) = J2kLi %(C)£ fc 
whose coefficients C ^ 9fe (C) are differentiate such that the following holds. Suppose that < p = 
p(n) < 1 is a sequence such that < (jZi)p = c = c ( n ) < (d — 1) 1 — e for an arbitrarily small e > 
f/za? remains fixed as n — > oo. Le? P(c, fc) denote the probability that in Hd{n,p) some fixed vertex v V 
lies in a component of order k. Then 

P{c,k) = (1 + o(n" 2 / 3 ))g fe (c) for all l<k< In 2 n. 

Furthermore, for any fixed e > f/zere z's a number < 7 = 7(e) < 1 smc/z f/zaf 

<7fe(c)<7 fe /or a/7 < c < (d - l)" 1 - e. (9) 



3 Proof of Theorem 2 

Throughout this section, we assume that c = c(n) = (jZi)p £ »7 / or some compact interval J C 
((d — 1) j 00). Moreover, we letTd R oe some fixed compact interval, and v denotes an integer such 
that [y — (1 — /a € X. A// asymptotics are understood to hold uniformly in c and (y — (1 — /a. 
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3.1 Outline 

Let e = e(J) > be independent of n and small enough so that (1 — e)("jT*)p > (d — + e. Set 
Pi = (1 — e)p. Moreover, let pi be the solution to the equation p\ + p2 — PiPi = p\ then p2 ~ ep. We 
expose the edges of Hd(n,p) in four "rounds" as follows. 

Rl. As a first step, we let Hi be a random hypergraph obtained by including each of the (™) possible edges 
with probability p\ independently. Let G denote the largest component of Hi . 

R2. Let H2 be the hypergraph obtained from Hi by adding each edge e ^ Hi that lies completely outside 
of G (i.e., e C V \ G) with probability P2 independently. 

R3. Obtain H 3 by adding each possible edge e ^ Hi that contains vertices of both G and V \ G with 
probability P2 independently. 

R4. Finally, include each possible edge e ^ Hi such that ecG with probability P2 independently. 

Here the 1st round corresponds to the first portion of edges mentioned in Section 1, and the edges added in 
the 2nd-4th round correspond to the second portion. Note that for each possible edge e C V the probability 
that e is actually present in H± is pi + (1 — Pi)p2 = P, hence H4 = Hd(n,p). Moreover, as > 
(d — + e by our choice of e, Theorem 4 entails that w.h.p. Hi has exactly one largest component of 
linear size Q(n) (the "giant component"). Further, the edges added in the 4th round do not affect the order 
of the largest component, i.e., ^/(H^) = JV(H 3 ). 

In order to analyze the distribution of N(Hd (n, p) ) , we first establish central limit theorems for M(Hi) = 
\G\ and JV(H^) = A[(H±) = JV(Hd(n,p)), i.e., we prove that (centralized and normalized versions 
of) M(Hi) and M(H 3 ) are asymptotically normal. Then, we investigate the number of vertices S = 
Af(H3) — J\f(Hi) that get attached to Gi during the 3rd round. We shall prove that given that \G\ = n\, S 
is locally normal with mean us + {ni — A*i)As and variance a s independent of m. Finally, we combine 
these results to obtain the local limit theorem for Af(Hd(n, p)) = Af(Hs) = N(Hi ) + S. 

Let ci = (jZi)pi an d c 3 = (d-i).P- Moreover, let < P3 < pi < 1 signify the solutions to the 
transcendental equations pj = exp \cj(p^ 1 — 1)] and set for j = 1,3 

n \ 2 Pi [l-Pj+Cj^-lXft-P-- 1 )]" , , _ 
Pi = (1 — pj)n, a, = j z — (cf. Theorem 4). 

The following proposition, which we will prove in Section 5, establishes a central limit theorem for both 
JV(Hi) and Af(Hs) and thus proves Theorem 1. 

Proposition 6. (Af(Hj) — fJ*j)/o~j converges in distribution to the standard normal distribution for j = 
1,3. 

With respect to the distribution of S, we will establish the following local limit theorem in Section 4. 
Proposition 7. Suppose that \m — pi \ < n - 6 . 

1. The conditional expectation ofS given that \G\ = n\ satisfies E(<S|A/i = ^1) = Ms + As ("1 — A*i) + 
o(y / n), where ps = 0(n) and Xg = 0(1) are independent of 'n%. 

2. There is a constant C > such that for all s satisfying \fig + Xs(ni — pi) — s\ < n 0,6 we have 
P[5 = i/|M = nx} < Cn-i. 

3. If s is an integer such that \p$ + Xs( n i — Ml) ~ s | < 0(^/n), then 

pre u/ - 1 1 ( (Ms + A s (ni -p, x )-s)' 

r [6 = s\Ali = ni\ ~ — — exp ' 



2nas 1 V ^ a s 
where erg = 0(yn) is independent of rii. 
Since A3 = A/i + 5, Propositions 6 and 7 yield 

M3 =Mi +Ms +o( v / n). (10) 

Combining Propositions 6 and 7, we derive the following formula for P [A3 = v] in Section 3.2. Recall 
that we are assuming that v is an integer such that [y — p)/a = (y — p^/cr^ E T. 
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Corollary 8. Letting z = (y — M3)/03> we have 

1 r°° 

P [/V3 =V ]„ / exp 

Proof of Theorem 2. Integrating the right hand side of (11), we obtain an expression of the form 

p [* = "l~^«"(-^). » 2 > 

where n, r 2 = 0(n). Therefore, on the one hand (A3 — ^z)/cr^ converges in distribution to the normal dis- 
tribution with mean k — ^3 and variance (7-/(73 ) 2 . On the other hand, Proposition 6 states that (A3 — M3 ) /03 
converges to the standard normal distribution. Consequently, |« — yu 3 | = o(t) and r ~ a 3 . Plugging these 
estimates into (12), we obtain P [A3 = v\ ~ -y= — exp (— §(i> — M3) 2cr 3~ 2 )- Since A3 = J\f(H<i(n,p)), 
this yields the assertion. □ 



■75- -9 (a;-(l + A5)— — 
2 z V as crs 



dx. 



(11) 



3.2 Proof of Corollary 8 

Let a > be arbitrarily small but fixed as n — > 00, and let C" = C" (a) > be a large enough number 
depending only on a. Set J = {ni G 7Z : \n\ — < C'*Jn}, let J' = {n\ G .ZJ : C'^fri < \n\ — Mi| < 
n 6 }, and J" = {m G ^ : |m - m\ > n 6 }. Then letting 

&x= p Wi = n i] P [S = f - m |M = m] . for x e {J, J', J"} 

we have P [A3 = v\ = \Pj + &ji + and we shall estimate each of the three summands individually. 
Since Theorem 4 implies that P [|Ai — /ill > n 51 ] < n~ 100 , we conclude that 

W j„ < P [Ai e J"] < n- 100 . (13) 

Furthermore, as a\ = 0(n), Chebyshev's inequality implies that 

P [M e J'] < P [|Ai - Mil > C'y/n\ < afC'- 2 ^ 1 < a/C, (14) 

provided that C is large enough. Hence, combining (14) with the second part of Proposition 7, we obtain 

9j> < P [M G J'] • 4= < 7TT7= < «" _1/2 ' ( 15 > 

where once we need to pick C sufficiently large. 

To estimate the contribution of n\ G J, we split J into subintervals Ji, . . . , Jr- of length between ^=77 
and Moreover, let Ij be the interval [(min Jj — H\)/a\, (max J, — mi)/* 7 !.]- Then Proposition 6 implies 
that 

/ exp(-a; 2 /2)<£r < V P [Ai = Hi] < -^3 [ cxp(-x 2 /2)dx (16) 

for each 1 < j < K. Furthermore, Proposition 7 yields 

pre I Kf 1 1 ^ -Ms - A s (ni 

P [6 = v — rai|Ai = n\\ ~ — exp 1 



27TCT5 V 2 ^5 

for each ni G J. Hence, choosing C" sufficiently large, we can achieve that for all n\ G Jj and all x G Ij 
the bound 

(1 + a) 2 / [v - mi - cria; - pis - As(ni - Mi)) 2 



P [5 = f — ni|A/i = ni] < — -== — exp 



2no~s 1 V 2cr?j 

(10) (1 + a) 2 / If <7i (T 3 \ 2 \ 

■exp -- (x-(l + A 5 ) z (17) 



77,75 \ 2 \ as as 
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holds. Now, combining (16) and (17), we conclude that 

K 

*j = H P EM =n 1 ]P[S = u- m|M = m] 



< 



\3 

27TCT 5 



(l + a) a 



S / exp 



— V - 9 I ' (1 + A S)— - ^ ' — 

2 2 \ as as 



dx 



„ 1 + 4a f 00 

~ 27TCTC ./_ _ 6XP 



ar 1 / , cr 3 

—FT - g (X- (1 + A 5 ) 2- 



d.T. 



(18) 



Analogously, we derive the matching lower bound 



l-4a Z" 00 
^.7 > -z / exp 



2 2 \ as as J 



dx. 



(19) 



Finally, combining (13), (15), (18), and (19), and remembering that P [A3 = u] = Wj + &j> + if - ,/", we 
obtain the assertion, because a > can be chosen arbitrarily small if n gets sufficiently large. 

4 The Conditional Distribution of S 

Throughout this section, we keep the notation and the assumptions from Section 3. In addition, we let 
G C V be a set of cardinality nx such that |m — < n 06 . 



4.1 Outline 

The goal of this section is to prove Proposition 7. Let us condition on the event that the largest component 
of Hi is G. To analyze the conditional distribution of S, we need to overcome the problem that in Hi 
the edges in the set V \ G do not occur independently anymore once we condition on G being the largest 
component of Hi. However, we will see that this conditioning is "not very strong". To this end, we shall 
compare S with an "artificial" random variable Sg, which models the edges contained in V \ G as mutually 
independent objects. To define Sq, we set up random hypergraphs Hj g, j = 1, 2, 3, in three "rounds" as 
follows. 

Rl\ The vertex set of Hi q is V = {1, . . . , n}, and each of the ( n ^ n ) possible edges e C V \ G is 

present in Hi q with probability p\ independently. 
R2'. Adding each possible edge ec7\G not present in Hi.g with probability P2 independently yields 

H2,G- 

R3'. Obtain i?3,G from i?2,G by including each possible edge e incident to both G and V \ G with proba- 
bility P2 independently. 

The process RT-R3' relates to the process R1-R4 from Section 3.1 as follows. While in Hi the edges 
in V \ G are mutually dependent, we have "artificially" constructed Hi g in such a way that the edges 
outside of G occur independently. Then, #2,G an d #3,G are obtained similarly as H2 and H3, namely by 
including further edges inside of V \ G and crossing edges between G and V \ G with probability p2- 
Letting Sg denote the set of vertices in V \ G that are reachable from G, the quantity Sg = \Sg\ now 
corresponds to S. In contrast to R1-R4, the process RT-R3' completely disregards edges inside of G, 
because these do not affect Sg- The following lemma, which we will prove in Section 4.3 shows that Sg 
is indeed a very good approximation of S, so that it suffices to study Sg- 



Lemma 9. For any v £ 2Z we have |P [S 



Af(Hi 



ni\ 



P[S G 



v\\ < n 



As a next step, we investigate the expectation of Sg- While there is no need to compute E(<Sg) precisely, 
we do need that E(<Sg) depends on ni — p,\ linearly. The corresponding proof can be found in Section 4.4. 
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Lemma 10. We have E(<S G ) = u$ + \s{ n i ~ Mi) + oi^/ri), where p$ = 0(n) and As = 0(1) do not 
depend on n\. 

Furthermore, we need that the variance of S G is essentially independent of the precise value of m. This 
will be proven in Section 4.5. 

Lemma 11. We have Var(<S G ) = 0(n). Moreover, ifG'dV is another set such that \p,\ — \G'\\ = o(n), 
then |Var(S G ) - Var(S G /)| = o(n). 

To show that <S G satisfies a local limit theorem, the crucial step is to prove that for numbers s and t 
such that s is "close" to t the probabilities P [Sg = s], P [Sg = t] are "almost the same". More precisely, 
the following lemma, proven in Section 4.2, holds. 

Lemma 12. For every a > there is fi > such that for all s, t satisfying \s — E(<S G )|, \ t — E(<S G )| < n oe 
and \s — t\ < fin 1 / 2 we have 

(1 - a)P [S G = s] - n- 1Q < P [S G = t] < (1 + a)P [S G = s] + n~ w . 

Moreover, there is a constant C > such that P [Sg = s] < CnT 1 ' 2 for all integers s. 

Letting Go = {1, ... , }, we define <r^ = Var(6>G ) and obtain a lower bound on ag as an immediate 
consequence of Lemma 12. 

Corollary 13. We have as = fi{y/n). 

Proof. By Lemma 12 there exists a number < fi < 0.01 independent of n such that for all integers s, t 
satisfying |s - E(<S G )|, \t — E(Sg)\ < y/n and \s — t\ < fi^fn we have 

P [S G = t}> ^P [S G - a] - n~ 10 . (20) 

Set 7 = /3 2 /64 and assume for contradiction that <r| < 777/2. Moreover, suppose that G = Go = 
{1, . . . , [Mill- Then Chebyshev's inequality entails that P [\Sg — E(£> G )| > y/yn] < \- Hence, there 
exists an integer s such that \s — E(5 G )| < y/yn and P [Sg — s] > . Therefore, due to (20) we 

have P [Sq = t] > j(jn)~i for all integers t such that \s — t[ < fiy/n. Thus, recalling that 7 = fi 2 /64, 
we obtain 1 > P [|<S G — s\ < fi^/n] — Ylt:\t-s\<i3^/n^ > = t] > j^== > 1- This contradiction shows 
that 0% > 771/2. □ 

Using the above estimates of the expectation and the variance of Sg and invoking Stein's method once 
more, in Section 5 we will show the following. 

Lemma 14. If |m — < n 6e , then (Sg — E(<5>g))/cts is asymptotically normal. 

Proof of Proposition 7. The first part of the proposition follows readily from Lemmas 9 and 10. Moreover, 
the second assertion follows from Lemma 12. Furthermore, we shall establish below that 

P[5 G = s] ^=exp(- ^ S ~^f^ } for any integer s such that \s - E(<S G )| = 0(y / n). (21) 

v27r V 2<7 5 / 

This claim implies the third part of the proposition. For (s — E( l S)) 2 cr^ 2 ~ (ps + ^s( n i — Mi)) 2f7 5 2 by 
Lemma 10 and Corollary 13, and P [S — s\Afi = ?7i] ~ P [Sg = s] by Lemma 9. 

To prove (21) let a > be arbitrarily small but fixed. Since a s = 0(n) by Lemma 1 1 and Corollary 13, 
Lemma 12 entails that for a sufficiently small fi > and all s, t satisfying \s — E(<S G )|, \t — E(<S G )| < n 6 
and \s — t\ < fio-g we have 

(1 - o)P [S G = s] - n~ w < P [S G = t] < (1 + a)P [S G = s] + n" 10 . (22) 
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Now, suppose that s is an integer such that \s — E(<S G )| < 0(y/n), and set z = (s — E(<S G ))/<7s. Then 
Lemma 14 implies that 

P [|5 G - s| < Pas] > / exp(-x 2 /2)^ > (1 - 2a)-£= exp(-z 2 /2), (23) 

provided that /3 is small enough. Furthermore, (22) yields that 

P [\S G -s\< Pa s ) = Yl PlSc=t}< (3a s ((l + a)P [<S G = s] + n" 10 ) 

t:\t-s\<0as 

< (1 + a)/3a 5 P [5 G = s] + 7i~ 9 , (24) 
because erg = 0(^/n) by Lemma 11. Combining (23) and (24), we conclude that 

I -2a 1 , 2/0 , l-4a f (s - E(<S G )) 2 



P [5 G = s] > = — exp(-z72) - n -9 > -= — exp 



1 + « V27T(T 5 V27TCT5 V 2a- 



Since analogous arguments yield the matching upper bound P [<S G = s] < ex P (~ ~ 2^°^ )' 

and because a > may be chosen arbitrarily small, we obtain (21). □ 

Next we will prove Lemma 12 which provides the central locality argument while the more technical 
proofs of Lemma 9, 10 and 1 1 are deferred to the end of this section. 



4.2 Proof of Lemma 12 

Since the assertion is symmetric in s and t, it suffices to prove that P [<S G = s] < (1 — a)~~ 1 Y [<S G = s] + 
n~ w . Let T = E(H 3 ^ G ) \ E(H 2 ,g) be the (random) set of edges added during R3'. We split T into three 
subsets: let T\ consist of all e <G T such that either |e \ G\ > 2 ore contains a vertex that belongs to a 
component of V \ G of order > 2. Moreover, JF 2 is the set of all edges e £ T \ T\ that contain a vertex 
of V \ G that is also contained in some other edge e' € T\. Finally, T 3 ~T \ {T\ U T?)\ thus, all edges 
e G JF 3 connect d— 1 vertices in G with a vertex v E V\G that is isolated in i?2,G + ^i + ^2> see Figure 1 
for an example. Hence, H^ q = i/2,G + J~i + -^2 + ^3- 




Fig. 1. The three kinds of edges (black) which attach small components to G. The edges of H2,g are depicted in grey. 
The (3-uniform) edges are depicted as circular arcs spanned by the three vertices contained in the corresponding edge. 



As a next step, we decompose 6> G into two contributions corresponding to T\ U JF 2 and JF 3 . More 
precisely, we let S G 1S be the number of vertices in V \ G that are reachable from G in H2,g + J~i + Ti and 
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set 5g° = S G - S G lg . Hence, if we let W signify the set of all isolated vertices of H 2> g + 3~i + J~i in the 
set V \ G, then Sq° equals the number of vertices in W that get attached to G via the edges in 

We can determine the distribution of <Sg° precisely. For if v £ VV, then each edge e containing v 
and exactly d — 1 vertices of G is present with probability p 2 independently. Therefore, the probability 
that v gets attached to G is 1 — (1 — p 2 )( d -~^ . In fact, these events occur independently for all v £ W. 
Consequently, 



= Bi 



(|W|,1 - (1 -P2) ( " l} ) , Miso = E(5g°) = |W|(1 - (l- P2 )G-i)) = (25) 



where the last equality sign follows from the fact that p2 ~ epi = ©(n 1 d ). 

Hence, 5g = Sq S + Sq° features a contribution that satisfies a local limit theorem, namely the bino- 
mially distributed Sq°. Thus, to establish the locality of Sg (i.e., Lemma 12), we are going to prove that 
Sq "inherits" the locality of Sq°. To this end, we need to bound | W|, thereby estimating fj,- lso = E(<Sg°). 

Lemma 15. We have P > §(n - m) exp(-c)] > 1 - n" 10 . 

The proof of Lemma 15 is just a standard application of Azuma's inequality, cf. Section 4.6. 
Further, let M be the set of all triples (H, Fi, F 2 ) such that 

Ml. P [S G = s\H 2 , G = H,Tx=F x ,F 2 = F 2 ] > n" 11 , and 

M2. given that H 2 ,g = H, T\ = F u and T 2 = F 2 , the set W has size > |(n - ni) exp(-c) = J?(n). 

Lemma 16. /fls— 1\ < (3y/n for some small enough (3 = f3{a) > 0, thenP [Sg = i|(-ff2,G, (Fx-, (F-i) G M] > 
(1 - a)P [<S G = sK^a.G, ^1,^2) € M]. 

Prao/ Let (F, F U F 2 ) £ M, and let 6 be the value of S G ig given that H 2 , G = H,(Fx = F\ and (F 2 = F 2 . 
Then given that this event occurs, we have Sg = s iff Sq° = s — b. As (H, Fx , F 2 ) £ M, we conclude that 



P [S G = s\H 2 . G =H,F 1 =F 1 ,F 2 = F 2 ] = P [Bin f \W\, 1 - (1 - p 2 )^ 



Ml 

>n -11 . 



Therefore, the Chernoff bound (7) implies that \s — b — /i; so | < n 0,6 . Furthermore, since we assume that 

\t-a\< (in 1 / 2 for some small (3 = (3(a) > and as /x iso = |W|(1 - (1 - p 2 )(*~J) > ^(n) due to M2, 
Proposition 3 entails that 

Bin (\W\,1 - (1 - p 2 )(«-J) =t-b] > (l-a)P [Bin (|W|, 1 - (1 - p 2 )(*-J) =s-b 



Thus, the assertion follows from (25). □ 
Proof of Lemma 12. By Lemmas 15 and 16, we have 

P [S G = s] < P [S G = s\(H 2iG , Fx, F 2 ) £ M] P [{H 2 . G , Fx,F 2 ) £ M) + (1 - a)' 1 ? [S G = t] 

M1,M2 

< n- 11 + P [|W| = o(n)] + (1 - a)" 1 ? [<S G = i] < (1 - a)" 1 ? [5 G = t] + 
as claimed. □ 



4.3 Proof of Lemma 9 

Let £g signify the event that G is the largest component of H\, Given that Cg occurs, the edges in if 3 — G 
do not occur independently anymore. For if Cg occurs, then Hi — G does not contain a component on 
more than \G\ vertices. Nonetheless, the following lemma shows that if E C £{V) \ £{G) is a set of 
edges such that the hypergraph H(E) = (V, E n £(V \ G)) does not feature a "big" component, then the 
dependence of the edges is very small. In other words, the probability that the edges E are present in if 3 
is very close to the probability that these edges are present in the "artificial" model H^.g^ in which edges 
occur independently. 
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Lemma 17. For any set E C £{V) \ £(G) such thatM(H(E)) < In 2 n we have 

P [E(H 3 ) \ £(G) = E | C G ] = (1 + O(n- 10 ))P [E(H 3iG ) = E] . 

Before getting down to the proof of Lemma 17, we first show how it implies Lemma 9. As a first step, we 
derive that it is actually quite unlikely that either H 3 — G or H 3G — G features a component on > In 2 n 
vertices. 

Corollary 18. We have P [Af(H 3 - G) > In 2 n\C G ] , P [Af(H 3tG - G) > In 2 n] = 0(n~ w ). 

Proof. Theorem 4 implies that P [Af(H 3 _ G — G) > In 2 n] = 0(n~ 10 ), because H 3tG simply is a random 
hypergraph H d (n - ni,p), and ( n ^)p — { n d-i)P < ( d ~ ^O" 1 b y ( 8 )- Hence, Lemma 17 yields that 

P [Af(H 3 - G) < In 2 n\C G ] > (1 - O(n- 10 ))P [^(ffa.G - G) < In 2 n] > 1 - 0( n - 10 ). □ 

Proof of Lemma 9. Let denote the set of all subsets E C £ (V) \ £ (G) such that in the hypergraph 
(V, E) exactly s vertices in V \ G are reachable from G. Moreover, let B s signify the set of all E € A s 
such that J\f(H (E)) < In 2 n. Then 

P [S = s\L G ] = P [E(H 3 ) \ £{G) e As\C G ] , and P [S G = s] = P [^(F 3 , G ) G A] . (26) 

Furthermore, by Corollary 18 

P [E(H 3 ) \ £(G) e A \ B S \C G ] < P [7V(F 3 - G) > In 2 7i|£ G ] = 0(n~ w ), (27) 
P [^(^3,g) eA\B,]<P [M{H 3 , G -G)> In 2 n] = 0(n^ 10 ). (28) 

Combining (26), (27), and (28), we conclude that 

P{S = s\£ G ] = P[E(H 3 )\£(G)eB s \C G }+0(n- w ) 

Lemma 17 p £ + = p ^ = g] + O^" 10 ), 

thereby completing the proof. □ 

Thus, the remaining task is to prove Lemma 17. To this end, let H.i(E) denote the event that £(V \ 
G) n E{Hi) = E. Moreover, let H 2 {E) signify the event that £{V \ G) n E(H 2 ) \ E{H{) = E (i.e., E is 
the set of edges added during R2). Further, let H 3 {E) be the event that £{G,V \G) n E{H 3 ) = E (i.e., E 
consists of all edges added by R3). In addition, define events TLi :G (E), Ti.2, G {E), Tt 3tG (E) analogously, 
with Hi, H 2 , H 3 replaced by Hi G , H 2 . G , H 3 G . Finally, let C G denote the event that G is a component of 
Hi. In order to prove Lemma 17, we establish the following. 

Lemma 19. Let E x C £(V \ G), E 2 C £{V \G)\ E h and E 3 C £(G, V \ G). Moreover, suppose that 
M(H(Ei)) < In 2 n. Then P [a' = i n i {E % )\C G \ = (1 + O(n^ 10 ))P [A- = i n hG (Ei 

Proof. Clearly, 

P [H 2 (E 2 ) A H 3 {E 3 )\C G A Hi{Ei)] P [HtiEl) A C G ] 



^HiiE^lC 



G 



P[C C 



(29) 



Furthermore, since R2 and R3 add edges independently of the 1st round with probability p 2 , and because 
the same happens during R2' and R3', we have 

P [H 2 (E 2 ) A H 3 (E 3 )\C G A Hi{Ei)] = P [H 2<G (E 2 ) A W 3 ,g(^ 3 )|Wi,g(^i)] ■ (30) 

Moreover, given that Tti(Ei) occurs, Hi — G has no component on more than In 2 n vertices. Hence, 
G is the largest component of Hi iff G is a component; that is, given that Tii(Ei) occurs, the events 
C G and C G are equivalent. Therefore, P [C G A Hi(Ei)} ~ P [C G A Wi(-Ei)]. Further, whether or not G 
is a component of Hi is independent of the edges contained in V \ G, and thus P [C G A Hi (Ei )] = 
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P [Cq] P [Hi (Ei)). Hence, as each edge in Ei is present in Hi as well as in Hi G with probability pi 
independently, we obtain 



P [C G A Hi(Ei)) = P [C G )p[ Ell (l-pi) £(v \ G) - ]El1 = P [C G ] P [Hi. G (Ei) 
Combining (29), (30), and (31), we obtain 

_ P[Cg) 



/\Hi(Ei)\Co 



P[C C 



■ P 



f\n i>G (Ei) 



(31) 



(32) 



Since by Theorem 4 with probability > 1 — n 10 the random hypergraph Hi = H^n^i) has precisely 
one component of order J?(n), we get prg^j = 1 + 0(n~ 10 ). Hence, (32) implies the assertion. □ 

Proof of Lemma 17. For any set E C £(V) \ £(G) let T(E) denote the set of all decompositions 
(E U E 2 ,E 3 ) of E into three disjoint sets such that Ei,E 2 C £(V \ G) and E 3 C £(G, V \ G). If 
N(H(e)) < In 2 n, then Lemma 19 implies that 



P[E(H 3 )\£(G) = E\£ G ) = P 

(E 1 ,E 2 .E 3 )eJ r (E) 
3 



/\Hi{Ei)\CG 



(l+0(n- w )) ]T P 

(Si,E 2 ,B 3 )e^"(-E) 



= (l + O(7i- 10 ))P [E(H 3 , G ) = E) 



as claimed. 



□ 



4.4 Proof of Lemma 10 

Recall that S G signifies the set of all vertices v <G V \ G that are reachable from G in H 3 G , so that 
Sg = \S G \. Letting C v denote the component of H 2G that contains v E V, we have 



E(S a ) = P e S G ] = E p [* e ^H^l - fc] P [|C| = k) 
vev\c vev\G fc=i 



(33) 



Since i?2,G is just a random hypergraph i?d(n — n i>£>)> an d because ~ ('d— I 1 )-? < — 1) 1 

by (8), Theorem 4 entails that Af(H 2 , G ) < In 2 71 with probability > 1 — n~ 10 . Therefore, (33) yields 



E(5 G ) = o(i)+ Yl E P[veS ||C| = fc]P[|C| = fc] 

i'GV'\G l<Ai<ln 2 n 



(34) 



To estimate P [v £ 5g||C«| = &]> let z = z(rii) = (m — Hi)/<Ji, £o = ex P 



-P2 



17-1) 



/"-Mil 
V d-l I 



and 



Additionally, let C,(z) 



( d-l )P 



Lemma 20. For allKk < lri 



■ have P[v € S G \ \C V \ = k) = 1 - £(;z) fe + O^ 1 • polylogn). 



Proof. Suppose that \C V \ = k but v £ S G . This is the case iff in H 3 G there occurs no edge that is incident 
to both G and C v . Letting £(G,C(v)) denote the set of all possible edges connecting G and C v , we shall 
prove below that 



\£(G,C V ) 



n 
d-l 



n - mi 
d-l 



1 



zoi ( n — fii 
d-2 



0(n ■ poly log n) 



0(n d ~ 1 ■ polylog^) 
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By construction every edge in £(G, C v ) occurs in H$ g with probability p2 independently. Therefore, 

P [v S G \\C V \ = k] = (1 - P2 f( G ^\ = (1 + 0{n- 1 ■ polylogn)) exp [-p 2 \S (G,C V )\] 
(l + 0(n- 1 -polylogn))^(^) fe , 

hence the assertion follows. 

Thus, the remaining task is to prove (35). As a first step, we show that 

^'i-C)-("^)-(v) + (""™ _t > 

For there are (™) possible edges in total, among which ( n ~^ k ) contain no vertex of C v , contain 
no vertex of G, and ( n ~ n J~ ) contain neither a vertex of C v nor of G; thus, (36) follows from the in- 
clusion/exclusion formula. Furthermore, as k = O(polylogn), we have — ( n ~ d k ) = (1 + 0(n _1 • 
polylognJJfc^J and { n ~^) - ( n "7- fe ) = (1 + 0(n" x • pol y logn))fc(" d l" 1 1 ). Thus (36) yields 



\£{G,C{v))\ = (l + 0(n- 1 ■polylogn))fc 



n \ I n — ni 
d- 17 ~ { d- 1 



(37) 



As m = fi t + za x , we have ("I" 1 ) = ( n d "l^) - za^'^) + 0(n d - 2 • polylogn), so that (35) follows 
from (37). □ 

Let <?(e,£) = SfcLi ?fc(C)s be the function from Proposition 5. Combining (34) with Proposition 5 
and Lemma 20, we conclude that 

E(S G ) = o(n^ 2 ) + q((n - m)p, £(z))(n - m) = o^ 1 / 2 ) + q(C(z), £(*))(n - m). (38) 

Since g is differentiable (cf. Proposition 5), we let Z\ c = §£(C(0), f(0)) and Z\ 5 = §§(C(0),£(0)). As 
C(z) - C(0), fCe) - C(0) = 0(n- 1 /2), we get 

<z(C(z), - ?(C(0), f(0)) = (C(z) - C(0)) A; + m)A + oin- 1 ' 2 ) 

= za x (" d ~ a') Ko^Pa - 4 c p] + °(n- 1/2 ). (39 ) 

Finally, let /i S = (n - ^iMC(O), £(0)) and A 5 = «(C(0),£(0)) - (d - 1) [efc^ - 4 C ] (Vi)?- Then 
combining (38) and (39), we see that E(S G ) = Ms + zcriAs + o(y / n), as desired. 



4.5 Proof of Lemma 1 1 

Remember that S G denotes the set of all "attached" vertices, and N v g the order of the component of 
v G V \ G in the graph Hi,g- 

The following lemma provides an asymptotic formula for Var(6>c). 

Lemma 21. Let r G j = P [N v ,g = iA»£ «Sg] and fa,% = P [A^g = i A v ^ Sg] /or any vertex v G 
V \ G. Moreover, set r G = J2i=i r G,i, Rg = Y^f=i ir G,i, Rg = J2i=i if G,% for L = [ln 2 n]. In 
addition, let a G = 1 — |G|/?i and 

R 2 1 — a d ~ 2 
r G = (l-R G )(R G ~r G ) + ((d-l)c-l)^+R G + {d-l)(l-a d G 2 )ecR 2 G + j^Rg- (40) 

?"G 1 — CXq 

Then \ai{S G ) ~ a G r G n + a G r G (l — r G )n. 
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Before we get down to the proof of Lemma 21, we observe that it implies Lemma 11. 

Proof of Lemma 11. By Theorem 4 part 2 together with Lemma 9 we know that with probability at least 
1 — ?i~ 8 there are no components of order > In 2 n inside of V \ G. Let <?(£,£) = X)fc=i Qk(C)£. k be the 
function from Proposition 5, and let £(z) be as in Lemma 20. Then Proposition 5 and Lemma 20 entail that 
for all v G V \ G 

r G ,i = 9< {("d-f)p) ^(' G l - ^' a ^ f ^ ~ * ((V- 1 (1 " C((|G| ~ /il)/fTl)) - 

By (9) there exists a number < 7 < lsuchthat^ (("^Jf )p) < 7*. Since < £((|G|-yUi)/(Ti) < 1, 
this yields rG », fG.i _• 7 l - Hence, R G , R g = 0(1), so that Lemma 21 implies Var(5c) = 0(n). 

Finally,ifG" C V satisfies ||G'HG|| < »°- 9 ,then|( n ^f)p-( n ^'l)p| = 0{\G\-\G'\)/n, because 

p = 0(?i 1 - d ). Therefore, |% ((™ d "Jf)p) - ft (('^if ')pj I = 0(|G| - \G'\)/n, because the function 

C l—> 9i(C) i s differentiable. Similarly, as £(z) = £o(l + ZCT1P2 ("dl^ 1 )) ^ or some fixed £0 = <9(1), we have 
|f((|G|-/ii)/<7i)-e((|G'|-A*i)/£ri)| = 0(|G|-|G / |)/n. Consequently, |r Gl< -r G ,,<| = 0(\G\-\G'\)/n 
and \f G ,i-f G ',i\ = 0(\G\ - |G'|)/n, and thus 

Vg - r G >\, \Rg ~ Rg>\, \Rg -Rg>\ = 0(\G\ - \G'\)/n = 0( n - 01 ). 

Hence, Lemma 21 implies that |Var(6>G) — Var(6>c)l = o(n). □ 
The remaining task is to establish Lemma 21. Keeping G fixed, in the sequel we constantly omit the 
subscript G in order to ease up the notation; thus, we write a instead of a G etc. As a first step, we compute 
P{v,w e S) - r 2 . Setting 

L 

51 = P l N *> = J A w e S|w ^ C V ,N V = i,veS]-P [N w = j A w e S]] 

xP [w £ G^ = i,u G S] P [iV„ = i A i> G S] , 

L 

5 2 = (1 - r) ^P[weC„ |JV„ = yeS]P [JV„ = i A v G S] , 

i=l 

we have P(u, w G S) - r 2 = Si + S 2 . 

To compute S2, observe that whether w G C v depends only on A^, but not on the event v G S. 
Therefore, P [10 G G w | A„ = i , 1; G S] = P [to G C v \ N v = i] = (^ 2 2 ) (""^ _1 = , because given that 

iVt, = i, there are (™°~i ) ways to choose the set G„ C V \ G, while there are (™°_^ 2 ) ways to choose G^ 
in such a way that u; G C v . As a consequence, 



1 - L 1 _ 

5 2 _ i)p [jv„ = i A v G SI = -(i? - r). 

ra — 1 n — 1 

i=i 

With respect to Si, we let 

P x (i, j) = P [JV» = i|to C V ,N V = i] , 

P 2 (i, j) =P[« S| = j, w # C V ,N V =i,v€S], 

so that 

Si=J2 iMhiWhl) - P [N m = j A w G S]] P [u> C„|A„ = i, « G S] P [A„ = * A v G S] 
~ ^ [A (», j) - P [JV» = i] P [w G S|A^ = j]} P[N v =iAveS]. 

i,3 
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Lemma 22. We have Pi(i,j)P [N w = j}^ 1 = 1 + {{d ~^T n ^ — + 0{n- 2 ■ polylogn). 

Proof. This argument is similar to the one used in the proof of Lemma 41 in [7]. Remember that if we 
restrict our view on i?3,G to the set V \ G the hypergraph is similar to a Hd (n — n\ , p). In order to estimate 
Si, we observe that 



P [N w = j in H d (n - n x ,p) \ N v = i, w & C v ] = P [N w = j in H d (n - m,p) \ C v ] . 



(41) 



Given that N v = i, Hd(n,p) \ C v is distributed as a random hypergraph Hd(n — rii — i,p). Hence, the 
probability that N w = j in Hd(n,p) \ C v equals the probability that a given vertex of Hd(n — ni — i,p) 
belongs to a component of order j. Therefore, we can compare P [N w = j in Hd(n — m,p)\ C v ] and 
P [N w = j in H d (n — ni , p)] as follows: in H d (n — ni — i,p) there are ( n ~" 1 Si~ ) ways to choose the set 



C w \ {j}. Moreover, there are (" ™ J l ) 



' J ) — id) possible edges connecting the chosen set C w 
with V \ C w , and as C w is a component, none of these edges is present. Since each such edge is present 
with probability p independently, the probability that there is no C w -V \ C w edge equals 



(l-pjC""- 1 "*)-^"" 1 -" 4 "')-^). 

By comparison, in H d (n — ni,p) there are (""•"'j - ) ways to choose the vertex set of C w . Further, there 
— (^) possible edges connecting C w and V \ C w , each of which is present with 



are 



n-nt-j 



probability p independently. Thus, letting 7= ( n 7" < )-( n "V" j )-[(' , 7 1 ) - (""7^)] , we obtain 



P [N w = j in H d (n - m,p) \ C v ] 



n — m — 1 
3-1 



P [N w = j in H d (n - ni,p)} 
Concerning the quotient of the binomial coefficients, we have 

-1 



n — ni — 1 

i-i 



(i-pf 



n — rii — i — ±\ f n — rii — 1 



exp 



i(j - 1) 



Moreover, 7 = ( n ~™ 1 ) 



{n-n 1 ~i) d + (n-n 1 -j) d -(n-n 1 -i-j) d 



n — ni 
- 1 



+ 0(n 2 ■ polylogn) 



(42) 



(43) 



n — ri\ 
d 



(n — ni) 2 



+ 0(n 3 • polylogn) 



Expanding the falling factorials, we get 

ij + 0{n d ~ 3 ■ polylogn). 



n — ni 
d-2 



(44) 



Plugging (43) and (44) into (42), we obtain 



P[N w =jmH d (n-n u p)\C v ] 
P [N w = j in H d (n - m,p)] 



exp 



exp 



n — 111 
i{j - 1) 



0(n 2 ■ polylogn) 



(1-P) 



-Cd-2 1 ) i 3+°( nd '''Polylogn) 



n — ni 
d-2 



ijp + 0(n 2 ■ polylogn) 



n — ni 

1 + (n - m) _1 [((d - l)c - l)ij + 0(n~ 2 ■ polylogn) 



Therefore, by (41) 

P [N w = j\N v =i,w<£ C v ] - P [N w = j in H d (n - m,p)] 

=P [N w = j in H d (n - m,p)] [n~ l [((d - l)c - + + 0(n.- 2 ■ polylog nf}5) 

□ 



Lemma 23. Setting 71 = ppjp^ and 72 = (d - 1)(1 - or 2 )ec, we have P 2 (i,j) 
P[we S[N W = 3} = n^P [10 g S\N W = j] (j'71 - ijj 2 ) + 0(tT 2 ■ polylogn). 
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Proof. Let T be the event that JV^, = j, w $ C v , N v = i, and v G S. Moreover, let Q be the event 
that in H3 there exists an edge incident to the three sets C v , C w , and G simultaneously, so that P-2(i,j) = 
P [Q\F] +P[we S\^Q,F] P hQ|^] . 

To bound P [to G S\->Q, T] — P [to G S^iV^, = j], we condition on the event that C v and C w are fixed 
disjoint sets of sizes i and j. Let Q' signify the probability that C w is reachable from G in H^ q, and let 
Q denote the probability that C w is reachable from G in H^ a^ an d that the event occurs. Then Q' 
corresponds to P [to G S\N W = j] and Q to P [to G S\->Q, J 7 ], so that our aim is to estimate Q — Q'. As 
there are \£{G, C v )\ — \£{G, C v , C w )\ possible edges that join C v and G but avoid C w , each of which is 
present in H^ c with probability P2 independently, we have 

Q = 1- (i- V2 )\£{G,c v )\-\s(G,c M \^ while q' = i_ (i-pj,)^^-)!. 

Therefore, 

Q-Q' = (1 -p^CWI [l - (1 _ P2 )-|£(G,a,c w )| 

~ (1 - Q') (1 - exp [p 2 |£(G, C„, ~ ij(Q' - 1) 



7? 

d-2 



n 
d-2 



P2- 



As (j_i)jJ2 ~ £C, we thus get 



P [to G S|-.Q,.F] - P [to G 5*1^ = j] - ij(P [to G = j] - l)(d - 1)(1 - a d - 2 )ecn 



(46) 



With respect to P [Q|.F], we let /C signify the number of edges joining C v and G. Given that T occurs, 



K. is asymptotically Poisson with mean A^ 



Kd-l) 



(d-l) 



K = k, the probability that one of these fc edges hits C w is 'P(fc) 
V(k) ~ jfc 



P2 ~ t(l — or )ec. Moreover, given that 



n 

d-2 



n 
d-2 



n 

d-l 



Consequently, 



1 - exp(-Ai) ^ fc! 



no 
d-l 



£(C„,G) 
-1 



jfc(d-l) 



and thus 

1 - a d - 2 



I — a 



d-l 



](l-a d ~ 2 ) 



n(l-cxp(-A l ))(l-a d " 1 )' 



(47) 



Combining (46) and (47), we obtain the assertion. 
Thus, 



□ 



lS i ~ H P t 1 ' e 5 A ^ 



j] [7iJ + 72^] 



x J2 i(( d - !) c - +']P[weSAiV„ ( = j] + P [to 5 A iV u 

i=i 

p2 _ W 1 — a d ~ 2 

= ((d - l)c - 1)— + i? + 72 i? 2 + V -^P [JV„ = i] R 

r * — ' 1 — a" 1 

i=l 

= ((d - l)c - 1)— + R + (d - 1)(1 - a d - 2 )ecR 2 + 1 ~ °\ ] r. 

r 1 — a d 1 

Hence, letting r be as defined by (40) we have P [v, w G S] — P [u G 5] P [to G S] ) ~ r/n. Consequently, 

Var(S) ~ a-Tn + a 2 r(l - r)n. 
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4.6 Proof of Lemma 15 

The probability that a vertex v € V \ G is isolated in H 3i g is at least (1 — p)( rf- 1 )(1 — f^)^ -1 ) ~ 
cxp(-p(7J 1 1 ) - ep( d ™ 1 )) > exp(-c). Therefore, 

E(|W|) > (1 - o(l)) exp(-c)(n - m). (48) 

To show that |W| is concentrated about its mean, we employ the following version of Azuma's inequality 
(cf. [11, p. 38]). 

Lemma 24. Let C2 = YiiLi ft% be a product of probability spaces. Moreover, let X : Q — >■ R foe a random 
variable that satisfies the following Lipschitz condition. 

If two tuples u> = {u>i)i<i<K > = {^'i)i<i<K G ft differ only in their j 'th components for 

some 1 < j < K, then \X%j) - X(u>')\ < 1. { ' 

Then P [\X - E(X)| > t] < 2 exp(-^), provided that E(X) exists. 
Using Lemma 24, we shall establish the following. 

Corollary 25. Let Y be a random variable that maps the set of all d-uniform hypergraphs with vertex set 
V to [0, n]. Assume that Y satisfies the following condition. 

LetHbeahypergraph,andlete G £{V). Then \Y(H)-Y(H+e)\, \Y{H)-Y(H-e)\ < 1. (50) 

Then P [\Y(H 3 , G ) - E(Y(H 3 , G ))\ > n 66 ] < exp(-n 01 ). 

Proof. In order to apply Lemma 24, we need to decompose H^ g into a product JX[li ft* °f probability 
spaces. To this end, consider an arbitrary decomposition of the set £(V) of all possible edges into sets 
£i U • • • U£k so that K < n and \Ei{E{H 3t a) Hfj) < n aA for all 1 < j < K; such a decomposition exists, 
because the expected number of edges of H^^g is < (^)p = 0(n). Now, let Q e be a Bernoulli experiment 
with success probability p for each e € £ (V \ G), resp. with success probability p2 for e € £ (G, V \ G). 
Then setting J?; = Ileef we obtain a product decomposition H^^g = Y\f=i ft*- 

In addition, construct for each hypergraph H with vertex set V another hypergraph H* by removing 
from H all edges e e £i such that \E(H) C\£i\ > 4n° 1 (1 < i < K). Since \E(H 3tG ) n fj| is the sum of 
two binomially distributed variables, the Chernoff bound (7) implies that P [\E(H 3 ,g) PI £i] \ > 4n 01 ) < 
exp(— ?i 05 ). As if < n, this entails 

P [H 3 , G ^ Hl G ] < A'exp(-?i 05 ) < exp(-n 04 ), so that (51) 
\E(Y{H 3iG )) - E(Y(Hl G ))\ < 1 [because < Y < n]. (52) 

As a next step, we claim that Y*(H) = ^n~ 01 Y(H*) satisfies the Lipschitz condition (49). For 
by construction modifying (i.e., adding or removing) an arbitrary number of edges belonging to a single 
factor £i can affect at most 4rt 01 edges of H* . Hence, (50) implies that Y*(H) satisfies (49). Therefore, 
Lemma 24 entails that 

P [\Y(Hl G ) - V(Y(Hl G ))\ > n - 63 ] < P [\Y*(H 3 , G ) - E(Y*(H 3<G ))\ > n ' 52 ] < exp(-n - 02 ). 

(53) 

Finally, combining (51), (52), and (53), we conclude that 

P [\Y(H 3tG )-E{Y(H 3tG ))\ > n ' 64 ] < P [\Y*(H) - E(Y*(H))\ > n a63 ] + P [H 3 , G Kg] 

< exp(-n 01 ), 

thereby completing the proof. □ 
Finally, since |W|/d satisfies (50), Lemma 15 follows from Corollary 25 and (48). 
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5 Normality via Stein's Method 

In this section we will use Stein's Method to prove that Af(Hd(n,p)) as well as Sg tend (after suitable 
normalization) in distribution to the normal distribution. This proofs Proposition 6 as well as Theorem 1 and 
Lemma 14. First we will define a general setting for using Stein's Method with random hypergraphs which 
defines some conditions the random variables have to fulfill. Then we show in two lemmas (Lemma 28 
and Lemma 29) that the random variables corresponding to Af(Hd (n,p)) and Sq do indeed comply to the 
conditions and last but not least a quite technical part will show how to derive the limiting distribution from 
the conditions. 

5.1 Stein's method for random hypergraphs 

Let £ be the set of all subsets of size dof V = {1, . . . , n}, and let H be the power set of £. Moreover, let < 
Pe < lforeache 6 £, and define a probability distribution on TL by letting P [H] = YieeH Pe'Yiee£\H 1 — 
p e . That is H € H can be considered a random hypergraph with "individual" edge probabilities. 

Furthermore, let A be a family of subsets of V, and let (Y a ) ae _A be a family of random variables. 
Remember that for Q C V we set £{Q) = {e £ £ : eflQ / 0}. We say that Y a is feasible if the following 
holds. 

For any two elements H,H' eH such that H n £(a) = H' n £{a) we have Y a (H) = Y a (if')- 

That means Y a is feasible if its value depends only on edges having at least one endpoint in a. In addition, 
set Y c f{H) = Y a (H \ £{S)) (H e H, a € A, S C V, S n a = 0). Thus Y£(H) is the value of Y a after 
removing all edges incident with S. We define 

Y=Y,Y a , Ma=E[F Q ], (7 2 =Var[y], X a = (Y a - fi a )/a (54) 

r7 ry . rj -\ f Yfj if Of H (3 j= , 

Za=}_^Z a0 , whereZ^=a * [ Y/3 -Y$ if an [3 = <b, (55) 
V a p= J2 K/ a + E (Yj-Y^)/a, and (56) 

-y-.pn-yyt® 7:/3ri-y = 

5=J2 E [\ X <*\ Z Z]+ J2 (E[\X a Z af3 V a p\]+E[\X a Z al3 \}E[\Z a + V af3 \}). (57) 

aS^l a,f3EA 

The following theorem was proven for graphs (i.e. d = 2) in [3]. The argument used there carries over 
to the case of hypergraphs without essential modifications. Thus for the sake of brevity we omit a detailed 
proof of this result. 

Theorem 26. Suppose that all Y a are feasible. If 5 = o(l) as n — ► oo, then Y ~ E \- Y ] converges to the 
standard normal distribution. 

Now the following lemma states that a number of conditions on the expectations of the product of up to 
three random variables Y~ will suffice for <5 = o(l). The conditions are identical for both statements we 
want to prove and we will prove that they are fulfilled in both cases in the next two sections while the proof 
of the lemma itself is deferred to the end of the section. 

Lemma 27. Let k = 0(polylog?i) and let (Ya)oi&A be a feasible family such that < Y a < kfor all 
a G A. If the following six conditions are satisfied, then S = o(l) as n — > oo. 

Yl. WehaveE{Y),Va,T{Y) = 0(n), and^ eA:f3na ^ ftp = 0(E(r)/n-polylogn) = 0(polylogn)./ r 
any a £ A 

Y2. Let a, (3, 7 be distinct elements of A. Then 

Y a (Yp - Yj?)Y$ = if a n (3 = 0, (58) 
Y a Y p = if a n (3 ? 0, (59) 
{Y p - Y?)Yf = (Yf, - YfiYy = if a n (3 = a n 7 = ^ P n 7. (60) 
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Y3. Fora/Za,/3vve/iaveX; 7:7 n/3#0,7nQ=0 E ( r /3 r 7 Q ) < fc V/3- 
Y4. If a, j3 <E A are disjoint, then 

E \Y a Yp] = 0{pL a ^j3 ■ polylogn), 

E[\Yf,-Y?\] =0(^ -polylogn), 

M n 

E [y a |F - y^l] = • polylogn). 

Y5. If a, j3, 7 G .4 are pairwise disjoint, then 

E - Y^|] = 0(»l • polylogn), 

E [\Y P -Y£\ ■ |Y° - Y^\] = 0(» • polylogn), 



e [y a |y^ - y«| • |y; - y^i] = o{ 

E[Y a \Y /3 -Yp a \.\Y 1 -Y«\}=0( 



polylogn), 
polylogn), 



E[|(y^-y^)(y 7 -y; 

Y6. If a, /3, 7 G A satisfy an/3 = an7 = 0, then 



0(^£- polylogn). 



Efly^-y^Q =0(^ -polylogn 



(61) 
(62) 

(63) 

(64) 
(65) 
(66) 
(67) 
(68) 

(69) 



5.2 Conditions for the normality of Af(Hd(n, p)) 

In this section we will prove the properties Y1-Y6 defined in Lemma 27 for the case of the normality of 

N(H d (n,p)). 

Let k = O(polylogn) and let A = {a C V : 1 < \a\ < k}. Moreover, for A C V with AC\a = $ 
let IQ = 1 if a is a component of H\ £(A), and otherwise. Further, set Y^ = |a| • J^. We briefly write 
I a = 7* and Y, = Y^. Then (y,) Q g^t is a feasible family, because whether a is a component or not only 
depends on the presence of edges that contain at least one vertex of a. 

Let C(S) denote the even that the subhypergraph of H induced on S C V is connected. If I a = 1, then 
C(a) occurs. Moreover, H contains no edges joining a and V \ a, i.e., H n £(cx, V \ a) = 0. Since each 
edge occurs in H with probability p independently, we thus obtain 



P[/ a = l]=P[C(a)](l-p) 



|£(a,V\a)| 



Furthermore, observe that 

VaeA,AcBcV\a:l£ = l->l£ = l. 
Proof of Yl: We know from Theorem 1 that E [Y] = <9(n) and Var [Y] = <9(n). To see that 

M/3 = 0(E [y]/n • polylogn), 



(70) 



(71) 



note that /ip := E [Yg] depends only on the size of (3. Thus with /it = p,p for an arbitrary set j3 of 
size b we have E [Y] = J^peA^P 



EiLiE^x M/3 = Eb=i(fc)M6 while E/3e^na#0 HP 



\p\= h 

Proof of Y2: (58): Suppose that I a = 1, Then Jf features no edge that contains a vertex in a and a 
vertex in (3. If in addition 72 = 1, then we obtain that Jg = 1 as well. Hence, Yp = Yg. 
(59): This just means that any two components of H are either disjoint or equal. 



21 



(60): To show that Y 1 (Yp — Yp) = 0, assume that 7 7 = 1. Then 7 is a component of H, so that (3 
cannot be a component, because 7 7^ (3 but 7 n /3 7^ 0; hence, Ip = 0. Furthermore, if 7 is a component of 
H, then 7 is also a component of H \ £(a), so that I" = 1. Consequently, Ip* = 0. Thus, Yp = Yp a = 0. 

In order to prove that Y"(Yp — Yp) = 0, suppose that I" = 1. Then Ip = 0, because the intersecting 
sets (3, 7 cannot both be components of H\£(a) . Therefore, we also have Ip = 0; for if (3 were a component 
of H , then /3 would also be a component of H \ £(a). Hence, also in this case we obtain Yp = Yp = 0. 

Proof of Y3: Suppose that Ip = 1, i.e., (3 is a component of H. Then removing the edges £(a) from 
H may cause /3 to split into several components B\, . . . , B\. Thus, if > for some 7 e i such that 
7 n /3 7^ 0, then 7 is one of the components B\, . . . , 7?;. Since I < \(3\ < k, this implies that given Ip = 1 
we have the bound E 7 : 7 n/3^0, 7 no:=0 Y" < k 2 . Hence, we obtain Y3. 

The following lemma which gives a description of the limited dependence between the random vari- 
ables I a and Ip for disjoint a and (3 together with the fact that P [I a = 1] = 0(/i Q ) implies Y4-Y6. 

Lemma 28. Let < I, r < 2, a«of let ai, . . . , ai, @i, . . . , r € A be pairwise disjoint. Moreover, let 
A\, . . . , A ri £?i, . . . , B r C y fee sefs such that Ai C i?i C V \ Pi and \Bj\ < 2k for all 1 < i < r, anrf 
assume that fY i=1 Bi \ Ai ~ 0. 77ie« 



A- A '•• 1 ^ - 

i=i 3=1 



< 0(n- r • polylogn) J] P [7 Qi = 1] [] P [7 ft = l] . 
3=1 3=1 



Prao/ Since (71) entails that 7^ 7^ if/ <-> if J = 1 A Igf = 0, we have 



'ft 



'ft 



ft 

Vi, j : 7 Q 



1A7 



ft 



OA/ 



ft 



1 



(72) 



We shall bound the probability on the right hand side in terms of mutually independent events. 

If I ai = 1 and Ip 3 = 1 for all then the hypergraphs induced on a, and are connected, i.e., 
the events C(ai) and C((3j) occur. Note that these events are mutually independent, because C(oj) (resp. 
C((3j)) only depends on the presence of edges e e £ (cti) \ £(V \ Q4) (resp. e C £(/??) \ £(V \ (3j)). 

Furthermore, if a, is a component, then in H there occur no edges joining Oj and V\di, in other words, 
Hn£(ai, V\ai) However, these events are not necessarily independent, because £(a±, V\ai) may 
contain edges that are incident with vertices in 02- Therefore, we consider the sets 



T(cti) = (J a,-U \JpjUBj, V{a i ) = £{a i ,V\a i )\£{F{a i )), 



3=1 



HPi) = (J a * u U ^' u U V ^i) = £ ^^ V \ ft) \ £ Wj))- 
i=1 j'Vj i' =1 

Then 7 aj = 1 (resp. 7^ = 1) implies that £>(«,) n 77 = (resp. £>(/%) n 77 = 0). Moreover, since 
the sets T>{on) and T>(j3j) are pairwise disjoint, the events T>{oti) H i7 = 0, T>((3i) n 7/ = are mutually 
independent. 

Finally, we need to express the fact that 7g 3 =0 but 7g 3 = 1, If this event occurs, then H contains 
an edge connecting (3j with Bj \ Aj, i.e., H n £(/3j, -Bj \ Aj) 7^ 0. Thus, let Q denote the event that 
H n £{f3 ] ,B j \ Aj) jt for all 1 < j < r. 

Thus, we obtain 



< P 



P[Vi,j:7 ai =1A7^ 3 = 0A/J =1 

/\(C(oi) A (X>(oi) n 77 = 0)) A f\ (CCS,-) A (D(&) n 77 = 0)) A Q 



3=1 



= H P [C(oi)] P p?(oi) n 77 = 0] x H P [COS,-)] P PiPi) n 77 = 0] x P [Q] 

i=i j=i 



(73) 
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We shall prove below that 

p [v(oti) n H = 0] - p [£>(/%) ni? = 0] ~ (74) 

P[Q] = 0(n- r -polylogn). (75) 

Combining (70) and (72)-(75), we then obtain the assertion. 

To establish (74), note that by definition T>(oii) C £ (a,, V \ a.;). Therefore, 

P [T>(cn) n H = 0] = (1 - > (1 - p )|£(«i.n«i)l . (76) 

On the other hand, we have |oj|, |.F(aj)| = O (poly log n), and thus \£(cei,!F(ai))\ < \cti\ ■ \J-(ai)\ ■ 
(d-2) = 0{n d ~ 2 ■ polylogn). Hence, as p = 0(n 1_d ), we obtain 

P p3(oi) nff = 0] = (1 -p)W<**)\ < (i- p )\£(<*i>v\ ai )\-\s( ai ,H<xi))\ 

~ (1 - p)|£(«*.v\«»0l exp ( p . o( n d-2 . polylogn)) - (1 _ p )l«(««.v\aOI. (77) 

Combining (76) and (77), we conclude that P [X>(o») n F = 0] ~ (1 - p)|£(a»>vV«)l. As the same argu- 
ment applies to P [X>(/3j) D H = 0], we thus obtain (74). 

Finally, we prove (75). If r = 1, then i7 contains an edge of £((3i, £>i \ Ai). Since 

\£((3 1 ,B 1 \A 1 )\ < ■ -n d ~ 2 = 0(n d ~ 2 • polylogn), 

and because each possible edge occurs with probability p independently, the probability of this event is 
P [Q] < 0(n d ~ 2 ■ polylog n)p = 0(n _1 ■ polylogn), as desired. 

Now, assume that r = 2. Then H features edges e 3 £ Bj \ Aj) (j = 1, 2). 

1st case: e x = e 2 . In this case, e\ contains a vertex of each of the four sets (3\, (3 2 , B\ \ A\, B 2 \ A 2 . 
Hence, the number of possible such edg es is < n d - 4 nj = i \ ■ \Bj \ A 3 \ = 0(n d - 4 • polylogn). 
Consequently, the probability that such an edge occurs in H is < 0(n d ~ 4 ■ polylog n)p = 0(n~ 3 ■ 
polylogn). 

2nd case: e\ ^ e 2 . There are < \/3j \-\Bj\Aj\- n d ~ 2 = 0(n d ~ 2 ■ polylog n) ways to choose ej (J = 1, 2). 
Hence, the probability that such edges ei, e 2 occur in H is < [0(n d ~ 2 ■ polylog n)p] 2 = 0(n~ 2 ■ 
polylogn). 

Thus, in both cases we obtain the bound claimed in (75). □ 



5.3 Conditions for the normality of Sg 

In this section we will prove the properties Y1-Y6 defined in Lemma 27 for the case of the normality of 
S G . 

Consider a set G C V of size m. Let A be the set of all subsets a C V \ G of size \a\ < k. Moreover, 
let p e = p for e C V \ G, p e = p 2 for e G £(G, V \ G), and p e = if e C G. 

For ACV and A n a = set l£ = 1 if a is a component of H \ £ (A U G). Moreover, let J A = 1 if 
(iZ \ £ (A)) n £(G, a) ^ 0. Further, let = l£ j£ and Y Q A = \a\K£. Then 

P [Jf„ = 1] = «(P [J„ = 1]). (78) 

Proof of Yl: Using Lemma 10 we have E [Y] = 6>(n) and using Lemma 1 1 we have Var [Y] = 0(n). 
The proof of the rest of Yl is analogous to the proof of Yl in the case of Af(Hd(n,p)). 

Proof of Y2: (58): Suppose that K a = 1. Then I a = 1, so that H \ £{G) has no a-/3-edges. Hence, if 
also Kp = 1, then (3 is a component of H \ £(G) as well. Thus, Kp = 1, so that Yp = Yg*. 

(59) : If K a = 1, then a is a component of H \£(G). Since any two components of H \£(G) are either 
disjoint or equal, we obtain Ip = 0, so that Yp = as well. 

(60) : To show that Y 1 {Yp — Yp) = 0, assume that K 1 = 1. Then 7 7 = 1, i.e., 7 is a component of 
H \ £(G). Since (3 7^ 7 but (3 n 7 7^ 0, we conclude that 7^ = 0. Furthermore, if 7 is a component of 
H \ £{G), then 7 is also a component of H \ £{G U a), whence Ip = 0. Consequently, Yp — Yp = 0. 
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In order to prove that Y^(Yp - Yg Q ) = 0, suppose that K™ = 1. Then K° = 1. Therefore, J| = 0, 
because the intersecting sets (3, 7 cannot both be components of H \ £{a). Thus, we also have Ip = 0; for 
if (3 were a component of H, then (3 would also be a component of H \ £(a). Hence, also in this case we 
obtain Yp = Y« = 0. 

Proof of Y3: Suppose that Kp = 1. Then Ip = 1, i.e., (3 is a component of H \ £(G). Then removing 
the edges £ a from H \ £(G) may cause (3 to split into several components Bi, . . . , B\. Thus, if Y^ > 
for some 7 G ^4 such that 7 0/3^0, then 7 is one of the components £>i, . . . , Since i < |/?| < fc, this 
implies that given Ip = 1 we have the bound 



E 

7: 7 n/3^0,7na=f 



r 7 Q < fc 2 



Hence, we obtain Y3. 

Similar to Lemma 28 the following lemma on the limited dependence of K a and Kp for disjoint a and 
P implies Y4-Y6. 

Lemma 29. Let < I, r < 2, a«c/ let oti, . . . , oti, Pi, . . . , r € A be pairwise disjoint. Moreover, let 
Af, . . . , A,., Si, . . . , B r C 7 sefs smc/z f/zaf i, C 5; C V \ /3j ant/ < O(l) /or alll < i < r, and 
assume that (Xi=i Bi \ Ai = 0. Then 



i=i j=i 



< 0( ? i- r ■ polylogn) [J P = 1] II P t X ft = J ] 



Prao/ Let P = P 



Vi, j : A' Qs =\KK^+K B p{\. If < J ,4 < J , then either l£ ± 1% or if 



Pi 



Ip* = 1 and J£ ^ J^ 3 . Therefore, letting J = {j : I£ ^ } and J = {1, . . . , r} \ J, we obtain 



7> < P 



A'" 1 A'" ' A(/ lA 4 

«=1 j'6.7 



(79) 



Now, the random variables I ai , Ip 3 , and Ip 3 are determined just by the edges in £ \ £{G), while J* 3 

and Jp 3 depend only on the edges in £{G). Hence, as the edges in £ \ £{G) and in £{G) occur in H 
independently, (79) yields 



V < P 



A'- 1 ' A /l 1 • A'" - l l 



A-'. 4 • 



(80) 



Furthermore, Lemma 28 entails that 



A'- 1 A'" 1 ' A'.' • l l 

- 1 jej j&J 



In addition, we shall prove below that 



< 0(n-W -polylogn)- JJ P [7 Qi = 1] JJ P = l] 



(81) 



< 0(n ! J ! • polylogn). 



(82) 



Plugging (81) and (82) into (80), we get V < 0{ n - r • polylogn) • ]jLi p ft* = 1] ■ Uj=i P = l] » 
so that the assertion follows from (78). 

Thus, the remaining task is to establish (82). Let us first deal with the case \ J\ = 1, Let j € J . 
If Jp 3 7^ Jp 3 , then Jp 3 = 1 and = 0, because Aj C -Bj. Thus, /5y is connected to G via an 
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edge that is incident with Aj \ Bf, that is, H n £{(5 3 ,Bj \Aj) ^ 0. Since there are \E(/3j,Bj \Aj)\ < 
\/3j | • \Bj\ ■ n d ~ 2 = 0(n d ~ 2 ■ polylog n) such edges to choose from, and because each such edge is present 

with probability p 2 = C^n 1 ^), we conclude that P jfi ^ J^] < P [H n £(0j,Bj \ Aj) ^ 0] < 
0(n d ~ 2 ■ polylog n)p2 = 0(n~ l ■ polylog n), whence we obtain (82). 

Finally, suppose that \J\ = 2. If J g 3 ^ J g 3 for j = 1,2, then there occur edges ej € H n £{(3j,Bj \ 



Aj){j = 1,2). 

1st case: e-y = e 2 . In this case e\ = e 2 is incident with all four sets f3j, Bj \ Aj (J = 1, 2). Hence, as the 
number of such edges is < n d ~ 4 J^ =1 \Pj\'\Bj\Aj\ — 0(n d ~ 4 -polylog n) and each such edge occurs 
with probability p 2 = 0(n 1 ~ d ), the probability that the 1st case occurs is 0(n d ~ 4 ■ polylog n)p2 = 
0(n~ 3 ■ polylog n). 
2nd case: e\ ^ e 2 . There are < \(3j\-\Bj\Aj 



,d— 2 



< 0(n -polylog n) ways to choose ej for j = 1,2, 
each of which is present with probability p 2 = 0(n 1 ~ d ) independently. Hence, the probability that the 
second case occurs is bounded by \0{n d ~ 2 ■ polylogn)p 2 ] < 0(n~ 2 ■ polylogn). 



Thus, the bound (82) holds in both cases. 



□ 



5.4 Proof of Lemma 27 

All we need to show is that the conditions defined in Lemma 27 imply that S as defined by (57) tends to 0. 
We will do so by proving that each of the three summands contributing to S is 0(ct _3 E [Y] ■ polylogn). 
Together with condition Yl, stating that E [Y], a 2 = 0(n), this implies the statement. We formulate one 
lemma for each summand, bounding the expectations using conditions Y1-Y6. The proof of the lemmas 
are mainly long and technical computations then. 

Lemma 30. E*eA E [\X a \ Z 2 a ] = 0( CT - 3 E [Y] ■ polylogn) 

Proof. Let 



* = E E 

aeA 



Y n 



E * 



^ = E E 

aeA 



S, 



E E 



Y n 



E rc> 

,/3:an/3=0 



Si 



E E 



aeA 



E * 

E 0r P 

i/3:an/3=0 



Y a ) 



Since X a = (Y a — « Q )/er < (Y a + fx a )/a, (54) entails that 



E [\X a \Z 2 a ] < 2a~ 3 E 



{Y a + l-ia) 



<2a~ 6 {S 1 +S 2 




E * 

/3:arW0 



E w-y?) 

yl3:anf)=$ 



Therefore, it suffices to show that Sj = 0(E(Y) ■ polylogn) for j = 1, 2, 3, 4. 
Regarding Si, we obtain the bound 



(59) 



5 i = E E E E [YaYpY^] < k 2 E [Ya] < 0(E [Y] ■ polylogn). 

aeA /3:an/3^0 7:007/0 



aeA 



With respect to S 2 , note that due to (59) and (61) we have E [YpY 7 ] < kfip if /? = 7, E \YpY^] 
(3 7^ 7 but [3 n 7 ^ 0, and E [YpYy] = 0(ppp 7 ■ polylogn) if f3 n 7 = 0. Consequently, 

& = E E E tw] 

QG.A /3:an/3^0 7:007^0 



VI 



- E ^ Q E E °0"/3M-y • polylogn) < 0(E(y) • polylogn). 

/3:an/3^0 7:aH7^0 



Oif 



(83) 
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Concerning S3, we obtain 

s * = E E E E[r a (^-V)(r 7 -y«)] 

o£^ /3:an/3=0 7:007=0 

(67), (60) 

< E E E 0(^ Q ^ (3 /i 7 n~ 2 ■ polylogn) 

ctG.4 /3:aH/3=0 7:aH7=0 

Yl 

< C>(n" 2 • polylogn)E(F) 3 < 0(E(Y) • polylogn). 

To bound S4, we note that for disjoint a, f3 E A and 7 G A disjoint from a the conditions (62), (59), 
and (68) yield 

0(^f- polylogn) if [3 = 7 

if/3^7,/3n7^0 
0(Mgx .polylogn) if/?n 7 = 0. 



Therefore, 



E E E We ~ Y e)^ - <EE ■ p 01 ^) + E °^ ■ p 01 ^) 



/3:an/3=0 7:aH7=0 



,36*476-4 /3G.4 
/„2 



/V 



< 0(E(y)7n 2 • polylogn) + 0(E(Y)/n ■ polylogn) 
= O (polylogn). 



Hence, we obtain S 4 < J2 a eA M« E/3 :Q rW0 E 7 :an7#0 



E 



(Y„ - K?)(Y 7 - r«) < o(e [y]- P oi y iogn). 



Lemma 31. E [l^a^^O = 0(<j- 3 E [Y] ■ polylogn) 



□ 



Then the def- 



Proof. Let Si = E [|X a Y^|] and S 2 = £^=0 E [ X a (Y fi - Y$)V af > 

inition (55) of Z a p yields that Eae.4 E/3g.A E [\X a Z a pV a p\] < a" 1 (Si + S2) Hence, it suffices to show 
that Si,S 2 = 0(er- 2 E [Y] ■ polylogn). 

To bound Si, we note that Y a Yp = if a n ^ but a ^ (3 by (59), and that V a p — if a = [3 by 
the definition (56) of V a p. Thus, if a n (3 ^ 0, then 



(54) 



E[\X a Yf,V a f,\] < <J- l ^[{Y a +^ a )\YpV a p\]<a~ 1 ^[\YpV aP \}. 



Furthermore, 

Ti(a,f3) 



Y7 



E E [\YpYf] <fcV 
7:70/3^0,700=0 



(84) 



(85) 



(64) 



T ^) = E E e[^|f«-f 7 ^|] < E E °( 



'M/3M7 



polylog n) 



i3:an/3^0 7:i9n 7 =0 



< 0(n 1 ■ polylogn) 



Yl 



/3:an/3^0 7:i9n 7 =0 

AaH7=0 



Eft 

7G.4 



E ^ 

/3:an/3#0 



< 0(n _1 E(Y) • polylogn) = O(polylogn). 
Combining (84)-(86), we get 



(86) 



(56) 



Si < a- 1 J2 E ^ E i\ y p v <*p\\ ^ a ~ 2 E 

Qe^/3:an^#0 



T 2 (a) + E Tl ( a " 9 ) 

/3:arW0 



< 0(a 2 • polylogn) 



E(Y) + k 2 J2 A*/J 

/3:an/3^0 



Yl 



< 0(cr- 2 E(F) - polylogn) 
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To bound 5*2, let a, (3 € A be disjoint. As X a < (Y a + /j, a )/<r, we obtain 

E[\x a (Y p -Y^)v aP \] < a- 1 E[|(r Q + AiQ )(y /3 -y /3 a )F Q/3 |] 

(56), (60) 

< a- 2 E[|(r Q + MQ )(y /3 -y /3 a )r«|] 

+a- 2 J2 V[\(Y a +n a m-Y£)(Y«-Y^)\} 

A«ri7 = 

< a- 2 {Ti+T2 + T^ + Ti), 

where 

Tl = E [\Y a (Y - Y?)Y?\], T 2 = /x a E [|(Y^ - ^ , 

T 3 = E E W " F /?)( y 7 Q - y 7 QU/3 )0 ' T 4 = E E [I ^ " ^^(^ " • 

7:/3n-y = 71/307 = 

Aq(17 = AaH7 = 

Now, Ti = by (58). Moreover, bounding T 2 by (62), T 3 by (66) and T 4 by (65), we obtain 

a 2 E [\X a {Y p - l?)^] < 0(» • polylogn) + £ . polylogn) 

-Y:/3n^ = 
Aari7 = 

= C( • polylogn). 

n 

Thus, (87) yields S 2 < a' 2 Ep :an p=$ O(^-polylogn) = Ofa-V"^^) 2 -polylogn) = 0(a- 2 E(F)- 
polylog n) , as desired. □ 

Lemma 32. E^E^E [|X„2 a/J |]E [\Z a + V a/3 \] = 0(a~ 3 E [Y] • polylogn) 

Proof. Since |crX a | < Y a + fj, a , 

EE E [l^^l] E [1^ + ^1] <^(EE IMP 3 [|Z a |] + E [\V af} \]) + 

aeAPeA a^AP^A 

E [Y a \Z a(3 \}(E [\Z a \] + E [|^/3|])).(87) 

Furthermore, we have the three estimates 

<tE[|z q |] < ( i> E ^+ E v[\ y p- y pW 

peA p-.anp^® p-.anp=® 

(62). Yl 

< E °(«"V/3 • polylogn) = O(polylogn), (88) 
Pea 

^E[\v a0 \] ( < E E [|^|] + E E[|y«-y 7 ^|] 

7:/3ri7^0 7:/3ri7 = 

AaH7 = AaPl7 = 

(69^yi 0( n -i M7 . polylogn) < O(polylogn), (89) 
76.4 

E^E[r Q |z Q/3 |] ( = } E E[y a r /3 ]+ E E [y Q \y p - y?\] 

PeA P:anP^tl /3:an/3=0 

(59), (63) V"^ r>/ i i \ mm 

= fc/i Q + > = 0(a* q • polylogn). (90) 

* — ' n 

p:anp=1l 
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Now, (88)-(90) yield 



£ ^2v a E[\Z af3 \}(E[\Z a \} + E[\V af} \}) 




'a 



a£A fl&A 



a£A 



E E E [ y « I^IKE [\Z a \] + E [\V aP \]) 



0{a- 2 E[Y] -polylogn), 
0(cr" 2 • polylogn) /i, 



'CK 



(91) 



a£A 



0(<j- 2 E[Y] ■ polylogn). 



(92) 



Combining (87), (91), and (92), we obtain the assertion. 



□ 



Finally, Lemma 27 is an immediate consequence of Lemmas 30-32. 

6 Conclusion 

Using a purely probabilistic approach, we have established a local limit theorem for Af(Hd(n,p)). This 
result has a number of interesting consequences, which we derive in a follow-up paper [4]. Namely, via 
Fourier analysis the univariate local limit theorem (Theorem 2) can be transformed into a bivariate one that 
describes the joint distribution of the order and the number of edges of the largest component. Furthermore, 
since given its number of vertices and edges the largest component is a uniformly distributed connected 
graph, this bivariate limit theorem yields an asymptotic formula for the number of connected hypergraphs 
with a given number of vertices and edges. Thus, we can solved an involved enumerative problem ("how 
many connected hypergraphs with v vertices and fi edges exist?") via a purely probabilistic approach. 

The techniques that we have presented in the present paper appear rather generic and may apply to 
further related problems. For instance, it seems possible to extend our proof of Theorem 2 to the regime 
c = ( jZi)p = (d — 1) _1 + o(l). In addition, it would be interesting to see whether our techniques can be 
used to obtain limit theorems for the fc-core of a random graph, or for the largest component of a random 
digraph. 
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